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Abstract 

Next-generation spectroscopic surveys will map the large-scale structure of the observable universe, 
using emission line galaxies as tracers. While each survey will map the sky with a specific emission 
line, interloping emission lines can masquerade as the survey’s intended emission line at different 
redshifts. Interloping lines from galaxies that are not removed can contaminate the power spectrum 
measurement, mixing correlations from various redshifts and diluting the true signal. We assess the 
potential for power spectrum contamination, finding that an interloper fraction worse than 0.2% 
could bias power spectrum measurements for future surveys by more than 10% of statistical errors, 
while also biasing power spectrum inferences. We also construct a formalism for predicting cosmo¬ 
logical parameter measurement bias, demonstrating that a 0.15-0.3% interloper fraction could bias 
the growth rate by more than 10% of the error, which can affect constraints on gravity from upcom¬ 
ing surveys. We use the COSMOS Mock Catalog (CMC), with the emission lines re-scaled to better 
reproduce recent data, to predict potential interloper fractions for the Prime Focus Spectrograph 
(PFS) and the Wide-Field InfraRed Survey Telescope (WFIRST). We find that secondary line identi¬ 
fication, or confirming galaxy redshifts by finding correlated emission lines, can remove interlopers in 
PFS. For WFIRST, we use the CMC to predict that the 0.2% target can be reached for the WFIRST 
Her survey, but sensitive optical and near-infrared photometry will be required. For the WFIRST 
[O III] survey, the predicted interloper fractions reach several percent and their effects will have to be 
estimated and removed statistically (e.g. with deep training samples). These results are optimistic 
as the CMC does not capture the full set of correlations of galaxy properties in the real Universe, 
and they do not include blending effects. Mitigating interloper contamination will be crucial to the 
next generation of emission line surveys. 

Key words: line:identification, galaxies:distances and redshift, large-scale structure of the universe, surveys, 
gravitation 
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1 Introduction 

A new generation of large-scale structure (LSS) sur¬ 
veys will come online in the next decade. In particu¬ 
lar, there is much activity on the spectroscopic front, as 
many ground-based surveys, including the Prime Focus 
Spectrograph (PFS) (Takada et al. 2014), FastSound 
(Tonegawa et al. 2015), the Hobby-Eberly Dark Energy 
experiment (HETDEX) (Hill et al. 2004, 2008), and 
the Dark Energy Spectroscopic Instrument (DESI) (Levi 
et al. 2013), expect to take data in the next several 
years, while the space-based surveys Euclid (Laureijs 
et al. 2011) and the Wide-Field Infrared Survey Telescope 
(WFIRST) (Spergel et al. 2013) plan to come online soon 
after. As more surveys are going deeper and wider to find 
more galaxies and collect better statistics on LSS, it is 
becoming more important to identify systematic effects 
that can contaminate cosmological parameter estimates. 
Moreover, in order to observe more galaxies at higher red- 
shift, these future surveys will acquire spectra with lower 
signal-to-noise ratio than were obtained by past projects 
such as the Sloan Digital Sky Survey (SDSS) (York et al. 
2000), which focused on relatively bright targets. SDSS 
galaxy spectra usually had many spectral features read¬ 
ily identifiable by eye, but future LSS surveys are likely 
to operate near the minimum acceptable signal-to-noise 
ratio for redshift determination using a single emission 
line. 

One systematic effect that could potentially contam¬ 
inate future LSS measurements is interloper contamina¬ 
tion. The Hubble expansion of the universe causes ex- 
tragalactic emission lines to appear redshifted. While 
the redshift of the emission line can identify the object’s 
Hubble velocity and distance, this effect can cause two 
lines with different rest-frame wavelengths from differ¬ 
ent distances to appear with the same wavelength, con¬ 
fusing the two lines. Interlopers can be problematic 
in power spectrum estimates as well as measurements 
of the correlation function, baryon acoustic oscillations 
(BAO), weak lensing, redshift-space distortions (RSD), 
scale-dependent clustering bias, and liigher-order corre¬ 
lations such as the bispectrum because interlopers intro¬ 
duce an extra source of correlation and dilute the exist¬ 
ing correlation, distorting the power spectrum. These 
distortions can contaminate constraints on cosmological 
parameters, including those for the growth rate of struc¬ 
ture. Interlopers are also a major consideration in in¬ 
tensity mapping surveys (Visbal & Loeb 2010; Pullen 
et al. 2014; Gong et al. 2014), where the data product 
is a low-resolution data cube (RA,Dec,A) and the prob¬ 
lem must be treated statistically instead of via object- 
by-object redshift classification. While it is possible to 


remove this distortion by measuring the interloper frac¬ 
tion in a small patch of sky, the required precision may 
be formidable in cases where the power spectrum mea¬ 
surement is very sensitive to the interloper fraction. 

Methods have been developed to identify interloping 
emission lines, including simple methods such as sec¬ 
ondary line identification and photometry (Kirby et al. 
2007) as well as more complex methods such as spectral 
template fitting. However, these methods are limited by 
the LSS survey parameters, including photometric bands, 
spectral coverage, survey depth, and (for ground surveys) 
atmospheric lines; thus, it is important to diagnose for a 
specific LSS survey how well interlopers can be identified 
within the data. 

In this paper, we construct a formalism describing how 
a power spectrum from a LSS survey is distorted by inter¬ 
loper contamination. This formalism is then used to de¬ 
termine the resulting bias within measurements of cosmo¬ 
logical parameters. We then calculate the interloper bias 
for the clustering bias of sample b g and the growth rate 
f g as a function of the interloper fraction. We find that 
for PFS and WFIRST, an interloper fraction > 0.2% will 
significantly distort power spectrum measurements over 
scales k > 0.01/iMpc -1 such that the fc-averaged power 
spectrum amplitude will shift by > 10% of the amplitude 
error. We also find that an interloper fraction greater 
than 0.3 (0.15)% can significantly bias growth rate mea¬ 
surements for PFS (WFIRST). 

To test if these biases will be exhibited in future sur¬ 
veys, we use the COSMOS (Ilbert et al. 2009) Mock 
Catalog (CMC) (Jouvel et al. 2011) to construct mock 
surveys for the PFS [O II] survey and the WFIRST Hct 
and [O III] surveys based on their flux sensitivity curves, 
for the cases of no spectral cleaning and with clean¬ 
ing using secondary line identification. Note that our 
analysis with the CMC is idealized in that it does not 
treat blended objects and is limited by the bank of SED 
templates in the catalog. We also made modifications 
in the CMC to better reproduce the properties of real 
objects in the Universe. But it does represent a “first 
look” at how serious the problem might be. Subject 
to these caveats, we determine that after secondary line 
identification, PFS will have an interloper fraction less 
than 0.2%, meaning interlopers may not be a great con¬ 
cern. For WFIRST, on the other hand, secondary line 
identification will only reduce some interloper fractions 
to 10-30%. We then determine photometric cuts using 
infrared photometric bands from WFIRST and optical 
bands from the Large Synoptic Survey Telescope (LSST) 
(LSST Dark Energy Science Collaboration 2012) that re¬ 
duce the interloper fractions to less than 0.2% at most 
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redshifts in the Ha survey. In the WFIRST [O III] survey, 
the interloper fractions in WFIRST are still very large, 
and deep spectroscopic fields will be needed to precisely 
measure the interloper fractions. Of course, deep sur¬ 
veys will also be needed for the PFS and WFIRST Ha 
surveys to confirm if interloper fractions for these sur¬ 
veys are indeed small. We also show that the WFIRST 
Y band depth will be necessary for interloper removal. 
Future LSS emission line surveys must consider how to 
implement these strategies to identify interlopers, as well 
as determine acceptable contamination levels for other 
cosmological parameters. 

The plan of our paper is as follows: in Sec. 2, we de¬ 
rive the interloper bias to the measured galaxy power 
spectrum and cosmological parameters as functions of 
the interloper fraction and redshift. We present methods 
to identify and remove interlopers in Sec. 3. In Sec. 4, 
we describe the mock surveys we construct for PFS and 
WFIRST using the CMC, and we give potential inter¬ 
loper fractions for PFS in Sec. 5 and WFIRST in Sec. 6. 
We state our conclusions in Sec. 7. Wherever not explic¬ 
itly mentioned, we assume a flat ACDM cosmology with 
parameters compatible with the Wilkinson Microwave 
Anisotropy Probe Seven-Year Data Release (WMAP7) 
(Jarosik et al. 2011). 

2 Interloper bias 

In this section we derive the effect of interlopers on the 
matter power spectrum measurement, as well as the bias 
introduced in cosmological parameter measurements. We 
consider a hypothetical survey that maps survey emission 
line galaxies (SELGs), detected through a survey emis¬ 
sion line (SEL) with rest-frame wavelength Asel, an d is 
contaminated by an interloping emission line with rest- 
frame wavelength Ai n t. Note in this section we assume 
only one interloper, but we can easily extend our analy¬ 
sis to multiple interlopers. Since emission line Asel from 
redshift zselg and emission line Ai n t at redshift z\ n t are 
observed at the same wavelength, each zselg has a cor¬ 
responding zi n t according to 

l+2int= SEL (1 + zselg) ■ (1) 

mnt 

2.1 Galaxy Power Spectrum 

We consider a galaxy power spectrum measurement con¬ 
structed from overdensity maps of number counts of 
galaxies, where each galaxy emits an emission line that 
appears to be a SEL with wavelength A = Asel(1 + 
zselg)- However, the number counts will include not 
only SELGs but interloping galaxies as well. Thus, the 


overdensity is written in terms of the total comoving 
number density nt = nsELG + n lnt in the form 


<5t(x) 


n t (x) n t 
n t 

^SELg(x) - n S ELG 
n t 

, nint (y) - ni n t 

' - ? 

n t 


( 2 ) 


where nj n t is the number of interlopers divided by the 
SELG volume element (not redshifted to zint), the barred 
quantities (n) are averaged over the survey area, and x 
and y are the 3D comoving position vectors of SELGs and 
interloping galaxies, respectively, y is the true position 
of an interloping galaxy at redshift zi n t such that it ap¬ 
pears to be at position x if it is assumed to be at redshift 
zselg- We can separate the position vectors in terms of 
their transverse (xj_) and radial (xy) components, and 
it can be shown that xj_ oc -D(zselg), the comoving dis¬ 
tance to redshift zselg, and xy oc (1 + zselg )AH(zselg) 
(similar expressions are true for y). Thus, with x be¬ 
ing the observed position of the interloper and y be¬ 
ing its true position, we define qj_ and qy such that 
(x_l, xy ) = (7_i_yj_,7||y||) with 


_ D(zselg) 

7± “ D(z lnt ) 

_ (1 + zselg)/77(zselg) 

7,1 (1 + Zint) / H (zint) 

_ A;[ n t-/L (.£l n t) 

Asel77(zselg) 


It is evident from Eqs. 1 and 3 that qj_ > 1 (qj_ < 1) for 
Aint > Asel (Ai n t < Asel). For qy, Ai nt > Asel causes 
H (zint) /H (zselg) < 1, and vice-versa, making the be¬ 
havior of 7 y less trivial. For Ai nt > Asel, we find qy > 1 
(qy < 1) for zsel < z A (zselg > z A ) where A = A Int / Asel 
and 


1 + za — \j ^ m -A(A+l), (4) 

for a flat ACDM universe. The opposite is true for Ai n t < 
Asel- 

We now assert the existence of an interloper fraction 
/, such that the number density of interlopers is / times 
the total number density of objects, or n\ n t = frit. Thus, 
/ = 0 is a map with no interloper contamination, and 
/ approaches unity as the interloper contamination in¬ 
creases. The quantity / is averaged over the sky and 
over each redshift bin, so that it does not contain LSS 
fluctuations and is not a random field. We write nt in 
terms of /, nsELG, and ni nt , according to 


_ uselg tll n t 

Ut ~ l~f ~ f ' 


(5) 


Thus, we can separate St into the overdensities of the two 
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sets of galaxies, in the form 

<5 t (x) = (l-/)<5 S ELG(x) + /<5int(y)- (6) 

A similar equation exists for the Fourier transform of 
this expression, except that the Fourier transform of the 
interloper term becomes 

<?nt s (k) = J eZ 3 xe lk ' x 5i„ t (y) 

= 7I711 J d 3 ye iq ' y <hnt(y) 

= 7j. 7|| ^int(q), (7) 

where q = (kj_ 7 _L,k|| 7 ||). This implies that the covariance 
of 5fn t s (k) is given by 

(Sint (k)< 5 I *° t bs (k , ))=( 7 i 7 ||) 2 Pi nt (q)fo(q - q') 

= 7 i_ 7 ||-Pint (q)^D (k — k 7 ), ( 8 ) 

where 5 d(x) is a 3D delta function. 

Since the two sets of galaxies are at very different red- 
shifts, their overdensities should be uncorrelated. In par¬ 
ticular, of the cases considered in this paper, the most sig¬ 
nificant interloper effect is Ha and [O III] (AlnA = 0.27), 
implying that the redshift separation of the contaminants 
is Aln(l + z) = 0.27. 1 Note that gravitational lensing 
could induce small correlations (Raccanelli et al. 2013a, 
2015a) between the SELGs and the interlopers, though 
we will neglect them in our analysis. Thus, the total 3D 
galaxy power spectrum Pt(k,p), including redsliift-space 
distortions (RSD) with p = fey/fc, can be written as the 
sum of the two individual components, according to 2 

Pt(f\k,p, zselg) 

=(1 — f) 2 PsELa(k, p, zselg) 

+f 2r y±l\\Pint[q(k,p),pg(p),zint ], (9) 

where 7 (p) = “ P 2 ) + 7 y P 2 , q(.k, p) = 7 (p)k, 

p q {p) =7||A t /7(^)> and T’selg {k, p, zselg) is given by 
(Kaiser 1987; Hamilton 1998) 

PsELG(k,p,ZsELG) = (1 p Pp 2 ) 2 Pg,r , (10) 

where P g , r is the power spectrum in real space and 
ft = fg/bg, with b g being the galaxy bias relating visible 
to dark matter halos. Strictly speaking, the shot noise 
will also vary with the interloper fraction. But since it is 
usually fitted and subtracted from the measured power 

1 An exception to this argument, where we would have to consider the cor- 
relation between the SEL and interloper density fields, would occur if 
Asel ~ Ajnt■ This does not occur in this paper, but it does occur in 
the case of Si III 1206 A contamination in the Lyman-a forest, where the 
target and contaminating lines are separated by only 2300 km/s; see e.g. 
McDonald et al. (2006). 

2 This expression was shown earlier in a private communication with 

D. Eisenstein. 


spectrum, we only consider the interloper bias to the clus¬ 
tering power spectrum. 

In Figs. 1 and 2, we plot the biased, spherically aver¬ 
aged, linear clustering power spectrum due to interlopers 
for various redshifts and values of A along with errors 
predicted for the PFS survey assuming no contamination. 
For the clustering bias, we assume b g (z) = 0.9 + 0.42:, a fit 
(Takada et al. 2014) to semi-analytic models (Orsi et al. 
2010) at the PFS flux limit that compares well with data. 
According to this model, lower flux limits can increase b g 
by ~ 0.5, and we find that the change in the fractional 
interloper distortion to the power spectrum due to in¬ 
creasing b g by one is much smaller than the PFS errors 
for small interloper fractions (/ < 2%). The same should 
be true for distortions to cosmological parameters. 

In Table 1, we show the bias-to-noise ratio (BNR), 
which is the ratio of the interloper bias of P(k) to its 
statistical error, averaged over the interval k = 10~ 4 — 
0.1/iMpW 1 . We find that an interloper contamination 
/ > 0.25% would bias the overall power spectrum signal 
relative to the P(k ) errors (BNR > 0.46), such that the 
total measurement errors ertot = <Tpy/ 1 + BNR 2 for PFS 
would increase by more than 10%. For WFIRST, this 
occurs if / > 0.2%. For / > 0.5% in either survey, we 
have BNR > 1, which would be catastrophic for power 
spectrum measurements. Thus, throughout our analysis 
/ < 0.2% will be our target interloper fraction; but note 
that if we have a significant interloper fraction that can 
be measured within ±0.2%, then we can accurately model 
the power spectrum of the SELGs well enough to account 
for it, particularly if the interloper contribution is small. 
Note that the PFS and WFIRST designs have continued 
to evolve; we use the point designs in Takada et al. (2014) 
and Spergel et al. (2013) but note that the analysis herein 
will have to be revisited for the final specifications. 

Nonlinear clustering could in principle modify our 
predictions on small scales. Several prescriptions exist 
for describing nonlinear clustering, including HALOFIT 
(Smith et al. 2003), convolution lagrangian perturba¬ 
tion theory (Carlson et al. 2013), effective field the¬ 
ory (Cheung et al. 2008), renormalization methods 
(Scoccimarro 1998), and the halo model (Crocce & 
Scoccimarro 2006; Cooray & Sheth 2002). In addition, 
nonlinear peculiar velocities can produce “fingers of god” 
(FoG) effects (Jackson 1972) in the power spectrum. 
More detailed work has been considered in the litera¬ 
ture regarding nonlinear biasing (Matsubara 2008; Jeong 
& Komatsu 2009; McDonald & Roy 2009; Baldauf et al. 
2012; Chan & Scoccimarro 2012; Nishizawa et al. 2013; 
Saito et al. 2014; Vlah et al. 2015) and nonlinear RSD 
(Scoccimarro 2004; Matsubara 2008; Taruya et al. 2010; 
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Reid & White 2011; Okumura et al. 2012b, 2012a; White 
2014; Okumura et al. 2015), yet here we will only con¬ 
sider a basic nonlinear clustering model. Specifically, as 
an example we consider the nonlinear power spectrum 
using the HALOFIT prescription computed from CAMB 
(Lewis et al. 2000) along with a FoG damping term given 
in Eq. 10 of Blake et al. (2011) with <r v = 2 Mpc /h. We 
find that the change in the fractional interloper distor¬ 
tion to the power spectrum due to nonlinear clustering 
is much smaller than the PFS errors for small interloper 
fractions (/ < 2%). 

For the specific purpose of constraining the effects of 
interlopers on BAO, we computed the BAO shifts for 
each of the cases in Table 1 (i.e. each combination of 
zsel , A, and /). In each case, we computed the linear 
power spectrum, with interlopers, at each value of p from 
0 to 1 in steps of 0.1, and did a template fit as in Seo 
et al. (2008), Eq. 1. The fits used a quadratic polyno¬ 
mial for B(k), a 7th order polynomial for A(k), and used 
the range of wavenumbers 0.02-0.3 h/ Mpc. Fits were 
performed with shot noise levels appropriate for both 
nP = 0.2 and 2 (measured at fc = 0.2/i/Mpc); see Blazek 
et al. (2015) Eq. B3 for the explicit equation. The shift 
in the BAO scale is parameterized by a, which re-scales 
the positions of the BAO features; a = 1 in the fiducial 
cosmology with no biases, but a > 1 (a < 1) indicates that 
the as-measured BAO ruler is shorter (longer) than the 
fiducial case. For the four combinations of zsel and A, 

the largest shifts jctwith interlopers 0: no interlopers! Com¬ 
puted are at most 0.25% for / = 0.02, 0.06% for / = 0.01, 
0.016% for / = 0.005, and 0.004% for / = 0.0025. Thus 
the BAO appears to be more robust against interlopers 
than the broadband power spectrum. The reason for this 
is that while interlopers dilute the power spectrum by a 
factor of 1 — / (see Eq. 9), this dilution has no effect on 
the power spectrum shape; only the f 2 term coming from 
the clustering of the contaminants moves the BAO fea¬ 
ture. It is thus both expected and numerically confirmed 
that the error in BAO measurements scales as oc f 2 . We 
also see that the BAO peak shift is negligible, even at the 
tenth-of-a-percent level of interest to future surveys, for 
/ < 0.01. 

However, although a pure dilution (i.e. adding un¬ 
clustered fake sources into the sample) would also not 
effect the measured RSD parameter /3, it would reduce 
the effective clustering bias of the sample. This would in 
turn reduce the measured growth rate f g =/3xb g , which 
plays a large role in constraining modified gravity. This 
is an example of the importance of a precise knowledge 
of the interloper fraction. 




Fig. 1. The predicted ratio between the measured 3D spherically-averaged 
galaxy power spectrum for A = 2 (interlopers with longer wavelengths) 
assuming various levels of interloper contamination and the 3D power 
spectrum with no contaminations, along with lcr errors bars predicted for 
the PFS survey, where A = Aint/AsEL- We plot the cases / = 2% (solid), 
f = 5% (dotted), / = 10% (dashed), and / = 20% (dot-dashed). This 
plot shows that interlopers could significantly bias (up or down) power 
spectrum measurements for PFS. 


Table 1. The interloper bias-to-noise ratio (BNR) for P(k), 
marginalized over k = 10 -4 — 0.1/iMpc -1 . The first 4 rows as¬ 
sume PFS errors, while the last row assumes WFIRST errors. 


/ 

A = 2 

z = 1.2 

A = 2 

z = 2.0 

A = 0.5 

0=1.0 

A = 0.5 

z = 2.0 

0.25% 

0.38 

0.39 

0.32 

0.39 

0.5% 

0.78 

0.81 

0.65 

0.82 

1% 

1.6 

1.6 

1.3 

1.7 

2% 

3.1 

3.3 

2.6 

3.4 

0.2% 

0.42 

0.44 

0.39 

0.45 
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Fig. 2. The predicted ratio between the measured 3D spherically-averaged 
galaxy power spectrum for A = 0.5 (interlopers with shorter wavelengths) 
assuming various levels of interloper contamination and the 3D power 
spectrum with no contaminations. The format is similar to Fig. 1. In this 
case, the volume distortion to the interloper power spectrum is small, 
causing the effect to the total power spectrum to be mainly a dilution of 
the true power spectrum. 


2.2 Cosmological Parameters 

In order to calculate the bias on cosmological parame¬ 
ters due to interloper contamination, let us call the mea¬ 
sured power spectrum di{f) = Pt(f\ki). In this deriva¬ 
tion, we will calculate a separate bias for each redshift 
bin, since the measurements in each redshift bin should 
be uncorrelated. From this assertion, we can construct a 
chi-squared, assuming Gaussian errors, of the form 

X 2 (P) = [d(/) - d(p)] T C“ 1 [d(/) - d(p)], (11) 


cial cosmological parameters as 

di(p) =d ij0 + J ia Ap a , (12) 

where Ji a = ddi/dp a , di , 0 = Px(fo), and di , 0 and Ji a 
are evaluated at the fiducial parameter values. However, 
since d(p) yfd for /yt0, interloper contamination changes 
the location of the \ 2 minimum, producing a bias Ap to 
any parameter estimations. Note that this formalism is 
strictly true for small distortions in the power spectrum. 
For large interloper fractions, our first-order expansion is 
insufficient and a Markov Chain Monte Carlo (MCMC) 
analysis is necessary to find the best-fit parameters upon 
interloper contamination. 

By writing the change in the power spectrum, evalu¬ 
ated at the fiducial parameters, as Ad = d(/) — d 0 , the 
resulting expression for x 2 is given by 

X 2 (p) = [Ad - JA P ] r C- 1 [Ad - JAp]. (13) 

Optimizing this expression to find the minimum value, 
we find 

J T C _1 JAp = J T C -1 Ad. (14) 

Recognizing the expression for the Fisher matrix F = 
J T C _1 J and defining AD = J T C _1 Ad, we solve for the 
parameter bias 

Ap = F _1 AD. (15) 


While the matrix multiplications are strictly sums over 
wavevector bins, we can approximate them as integrals 
(Tegmark 1997; Seo & Eisenstein 2003). We use the same 
formalism as in Takada et al. (2014), where F is given by 
their Eq. (5) with no sum over redshift, and RSD effects 
are included in the power spectrum. However, unlike in 
Takada et al. (2014), we use the full 3D power spectrum, 
not the observed P(fcii,A;j_) model that is valid only for 
BAO fitting (Seo & Eisenstein 2003). As is generally the 
case in Fisher analyses, we neglect correlations between 
bandpowers of the power spectrum due to nonlinear clus¬ 
tering or survey geometry, which slightly underestimates 
the errors. AD, similarly, is given by 


AI>a(zi)= / dp, 


2irk 2 dk ir n N 
2(2tt)3 


;; AP t (f\k,p,Zj) d\nP g , s (k,p,Zi) 

Pg, s (k,p,Zi) dp a 

x exp [—fc 2 Ej_ — fc 2 /z 2 (Ey — Ej_)] 


(16) 


where p is a vector denoting the cosmological parameter 
values, di( p) is the predicted power spectrum with pa¬ 
rameters p, and Cij is the covariance matrix for power 
spectrum measurements at wavenumbers fc, and kj . The 
model d(p) can be expanded to first order from the fidu¬ 


using the notation of Takada et al. (2014) where 
Pg,s(k,p,z) = PsEhaik, p, z), V eS is the effective vol¬ 
ume, and the exponential factor accounts for the non¬ 
linear BAO smearing. The expressions and BAO smear¬ 
ing values for V e s, Ey, and Ej_ are given in Eqs. 6-8 of 
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Takada et al. (2014). We integrate over the wavenumber 
range (fc m in,fcmax) = (10~ 4 ,0.5)fc/Mpc. Although fc max = 
0.5/i/Mpc may seem a bit high, the exponential BAO 
smearing factor suppresses information naturally from 
smaller scales. Additionally, in order to set conservative 
systematic requirements from interloper contamination, 
an “optimistic” estimate of statistical errors (i.e. large 
fcmax) should be chosen. APt(f\k,p,z) corresponds to Ad 
and is just AP t (f\k,p,z) = P t {f\k,p,z) - P S ELG(k,p,z). 
We can also split AD into two pieces according to 

AD= [(l-/) 2 -l]AD° + / 2 AD Int , (17) 

where 

A n, , f 1 , f k ™*2itk 2 dk rr 
AD a (zi)=J dp, J 2(27t)3 ft, Zi) 

„ d\nPg, s {k,fi,Zi) 

dp a 

xexp[-fc 2 Si-fcV(Sf-Ei)] (18) 

and 

T , r 1 max OTj-fcZjh, 

A D^(zi)=J dp J'^ £^-^ VeS (k,p, Zi ) 

y l\l\\PfL[q( k ,P),Pg(p),zint(zi)\ 

Pg,s(k,p,Zi) 
din Pg, s {k,p,Zj) 
dp a 

x exp [—fc 2 E(]_ — fc 2 /x 2 (Ey — Ej_)] ■ 

(19) 

Using these expressions and an interloper fraction esti¬ 
mate, we can predict the exhibited bias for any cosmo¬ 
logical parameter estimate per redshift due to interloper 
contamination for any LSS survey. 


2.3 Example: Growth Rate Bias 

In this section we focus on the effect of interlopers on 
measurements of growth rate parameters. In the formal¬ 
ism presented above (Eq. 10), the growth rate parameter 
f g enters in the RSD parameter f5. The growth rate pa¬ 
rameter f g is a key ingredient in our understanding of 
the correct cosmological model, as it is the logarithmic 
derivative of the linear growth factor, D(a) oc 5m, with 
respect to the scale factor a: 
d\nD 


fg = 


dlna 


( 20 ) 


In most cosmological and gravity models, f g can be pa¬ 
rameterized as (Linder 2005): 


fg = 0 m (a) 7 , 


( 21 ) 


where 7 is a parameter that is different for different cos¬ 
mological models: in the standard ACDM+GR model it 


is a constant, 7 « 0.55, while it is 7 « 0.68 for the self- 
accelerating DGP model [see e.g. Linder (2005)]. In some 
other cases, it is a function of the cosmological parame¬ 
ters or redshift. 

If one assumes that general relativity is the correct 
model for describing gravity, then the parameterization 
of Eq. 21 can be used to test cosmological parameters 
describing, e.g. the dark energy equation of state. I 11 
general, measuring f g gives strong constraints on the 
model of gravity, and it is the most popular measure¬ 
ment for testing general relativity and constraining mod¬ 
ified gravity models [see e.g. Guzzo et al. (2008); Linder 
(2008); Percival & White (2009); Song & Percival (2009); 
Stril et al. (2010); Blake et al. (2011); Zhao et al. (2012); 
Samushia et al. (2012); Raccanelli et al. (2013b); Reid 
et al. (2014); Sanchez et al. (2013); Samushia et al. 
(2013); Huterer et al. (2013); Samushia et al. (2014); 
Beutler et al. (2014); Raccanelli et al. (2015b); Zhao et al. 
(2015)]. 

Interlopers can in principle affect all cosmological pa¬ 
rameters, in different ways. This also means that de¬ 
generacy between them in a large Fisher matrix analysis 
could change due to interloper effects. A detailed study 
of it is beyond the scope of the present paper, and we 
leave it to future work. 

We plot in Figs. 3 and 4 the shift of the ( b g ,f g ) mea¬ 
surement due to the interloper fraction. Note that we 
keep all other cosmological parameters constant, which 
underestimates the errors. However, we do not expect 
this to affect our results because the b g -f g degeneracy 
is much larger than degeneracies with other parameters. 
Based on PFS errors, we can show that an interloper 
fraction > 0.3% can bias the growth rate measurement 
such that the error on f g increases by more than 10 %. 
For WFIRST, the interloper fraction must be < 0.15% to 
not significantly bias growth rate measurements. These 
target interloper fractions should be robust to nonlineari¬ 
ties and changes in the clustering bias, while correlations 
between band powers could increase the targets slightly. 


3 Methods to Remove Interlopers 

Since interlopers can greatly distort the power spectrum, 
future surveys will require methods to identify them. In 
principle this can be done at the level of individual galax¬ 
ies, or through statistical means. There are various sta¬ 
tistical methods for removing interlopers, including cross¬ 
correlation methods (Newman 2008; Menard et al. 2013) 
and spectroscopic deep fields. Cross-correlation meth¬ 
ods, where one correlates the survey map with another 
map with unambiguous redshifts, measure the product 
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Fig. 3. The shift in clustering bias and growth rate estimates due to 
interlopers for A = 2 (interlopers with longer wavelengths), where 
A = Ax n t/AsEL- We assume the fiducial parameters ( b g ,f g ) based on our 
fiducial model and mark the measured parameters for / = 0 (circle), 

/ = 0.25% (triangle), / = 0.5% (square), / = 1% (diamond), / = 2% 
(pentagon), and / = 5% (hexagon). We also show our predicted lcr PFS 
error ellipses for these parameters. This plot shows that interlopers could 
significantly contaminate these parameter measurements for PFS. 


of bias x/ without identifying the interloping galaxies. 
Spectroscopic deep fields would observe a representative 
sub-sample of the emission line galaxies to much higher 
signal-to-noise ratio - and probably across a broader 
wavelength range in a multi-instrument campaign - in 
hopes of detecting secondary lines for many more galax¬ 
ies. We focus on methods that apply to individual galax¬ 
ies here, and consider the statistical methods as a second 
line of defense that will be used to mitigate the residual 
interloper contamination in the catalog after galaxy-by¬ 
galaxy cleaning methods have been applied. 


Fig. 4. The shift in clustering bias and growth rate estimates due to 
interlopers for A = 0.5 (interlopers with shorter wavelengths). The format 
is similar to Fig. 3. 

3.1 Secondary Line identification 

One method to distinguish SELGs from interlopers is sec¬ 
ondary line identification. When a LSS survey measures 
an emission spectrum, it detects various atomic, ionic, 
and molecular emission lines. Since these lines will all 
have the same redshift, the ratios of the line wavelengths 
with each other should be the same as the ratios in the 
rest frame. Thus, we can identify a pair of emission lines 
by the ratio of their wavelengths before we know the emit¬ 
ter’s redshift. We can use this method to identify SELGs 
as well as interlopers. This method is immune to con¬ 
tamination by other pairs of lines if no two sets of line 
pairs have the exact same wavelength ratio, which is true 
of the strong lines in the optical and near-infrared (NIR) 
spectral ranges we consider. 3 

This method is limited for two reasons. One is that a 

3 There are examples of line ratios that are almost the same; for example, 
He II 1640 : Lya 1216 = Ha 6563 : H/3 4861. However neither He II 
1640 A nor Lya is a significant interloper for far-red/NIR surveys at the 
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line Z that works as a satisfactory secondary line for an 
emission line candidate at some redsliifts will fall off the 
spectral range at other redshifts. Another reason is that 
we cannot always detect every emission line. If there was 
a line A z that always appeared with the survey emission 
line, then it would be necessary for any spectra with a 
SELG candidate to have a corresponding line Z with the 
right wavelength. However, not all emission lines will 
have a sufficient signal-to-noise ratio (SNR) to register 
as an ELG, especially when one of the lines intersects a 
sky line. Thus, using this method will inevitably lead 
to some SELGs being rejected because its corresponding 
line Xz did not appear in the spectrum. It will also lead 
to interloping ELG candidates that are not SELGs be¬ 
ing accepted because its corresponding line A z did not 
appear in order to rule it out as a SELG. For each sur¬ 
vey, we must assess which lines can consistently serve 
as secondary lines for SELGs and their interlopers, as 
well as determine how often the secondary line test fails 
as a function of observed wavelength, or equivalently of 
^SELG- 

For secondary lines, we will take the conservative ap¬ 
proach by requiring the secondary line to have a SNR 
high enough to prevent a statistical fluctuation from mas¬ 
querading as a secondary line. To determine the nec¬ 
essary SNR, we compute the probabilities Pi and P 2 , 
where Pi = P(notaccepted|real) is the probability that a 
real line is rejected because it does not exceed our chosen 
SNR cutoff and P 2 = P(accepted|notreal) is the proba¬ 
bility that a statistical fluctuation in the spectrum is ac¬ 
cepted as a real line because the fluctuation was higher 
than the SNR cutoff. Assuming Gaussian fluctuations 
from an expected flux F e and a flux cutoff F c , we find 
the two probabilities are given by 

Pi (not accepted|real)=^ 

P 2 (accepted|not real)=i 


L — erf 
L — erf 


Qe Qc 
Qc 

V2 


( 22 ) 


where F n is the flux noise of the instrument, Q e = F e /F n , 
and Q c = F c /F n . As expected, increasing the cutoff Q c 
increases Pi, the rate of rejected true lines, and decreases 
P 2 , the rate of accepted false lines. I 11 general, we re¬ 
quire secondary lines to have a SNR greater than Q c = 4, 
making the number of statistical fluctuations accepted as 
true lines negligible at the expense of eliminating a fair 
amount of true lines. 

We relax the required SNR for the secondary line 
[O III] 4959 when identifying [O III] 5007 to Q c = 1 
(P 2 = 16%; see Eq. 22) since we know the line ratio 


depths being considered for PFS or WFIRST, as they would have to be at 
extraordinarily high redshifts. 


P e (5007A) : P e (4959A) = 3 from atomic physics (Storey 
& Zeippen 2000). Setting the [O III] 5007A line as the 
primary line, we can set the minimum value for Q e for 
the secondary line [O III] 4959A equal to 

„ SNRmin (5007A)P n (5007A) , 00 , 

3P44959A) ■ 1 

It should be noted that when the 5007A line is the inter¬ 
loper, secondary line identification will cause the number 
counts of the [O III] 5007A interloper to be multiplied by 
a factor of Pi and the number counts of [O II] emitters 
to be multiplied by (1 — P 2 ). However, when the 5007A 
line is the SELG, as in WFIRST, secondary line identifi¬ 
cation will cause the number counts of interlopers to be 
multiplied by a factor of P 2 and the number counts of 
[O III] lines to be multiplied by (1 — Pi). Thus, since Pi 
and P 2 are anti-correlated, our goal for each case is to set 
Qc such that the first probability is minimized without 
decreasing the second probability so much that the shot 
noise from the SELGs gets too big. 

3.2 Photometry 

We can also use photometry to rule out candidates 
for SELGs. As the spectrum of an object redshifts, 
its colors, or brightness differences between neighbor¬ 
ing photometric bands, will traverse through color 
space. Thus, an object’s photometric colors can de¬ 
termine its photometric redshift. Photometric redshift 
samples have been constructed in numerous surveys, 
e.g. the Sloan Digital Sky Survey (SDSS) (White 
et al. 2011), the Canada-France-Hawaii Telescope Legacy 
Survey (CFHTLS) (Hildebrandt et al. 2012), COSMOS 
(Ilbert et al. 2009). Techniques to determine photo¬ 
metric redshifts will also be used in weak lensing sur¬ 
veys such as Dark Energy Survey (DES) (Sanchez & Des 
Collaboration 2010), Hyper Suprime-Cam (HSC) 4 and 
LSST (LSST Dark Energy Science Collaboration 2012), 
as well as Euclid and WFIRST. 

In this study, we do not attempt to construct a scheme 
for determining photometric redshifts. Instead, we deter¬ 
mine photometric cuts that separate SELGs from inter¬ 
lopers. As an example, let us consider an [O III] emission 
line contaminating an Ha survey. The [O III] interlopers 
will come from a higher redshift than the Ha galaxies, 
which imply that for a small enough redshift range, they 
may inhabit different locations in color space, or differ¬ 
ent color loci. If this is the case, then a photometric 
cut that separates the two color loci can be determined, 
and this photometric cut can be applied to SELG candi¬ 
dates in the survey to identify interlopers. The redshift 

4 http://sumire.ipmu.jp/en/2652 
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ranges used for this method must be small to prevent the 
color loci from overlapping. In our analysis we determine 
photometric cuts within two-dimensional slices of color 
space, although it is possible to reach better accuracy by 
determining cuts within the full multi-dimensional color 
space. 

4 Mock Surveys 

We assess the performance of the secondary line identi¬ 
fication and photometry methods in reducing the inter¬ 
loper rate /(Asel — Ai n t,zsELG), the fraction of galaxies 
identified in a survey using emission line Asel that are 
actually interlopers with emission line Ai nt , given by 

/(Asel — Ai nt ,zsELG) 

_ _ -Afint(zint) _ (24) 

-^SELgC^SELg) + -Nl nt (zi nt ) ’ 

where IVselg(zselg) is the number of galaxies at 
redshift .zselg identified by emission line Asel, and 
IVint (^int) is the number of galaxies at redshift z\ n t sat¬ 
isfying Eq. 1 such that its emission line Ai n t interlopes 
the SEL. This definition of / is equivalent to Eq. 5. We 
seek interloper rates for three cases: (1) we include all 
interlopers with fluxes great enough to pass the survey’s 
detection criterium, (2) we include all interlopers from 
case (1) that also fail to be identified by the survey’s sec¬ 
ondary line identification test for the interloping ELG, 
and (3) we include all interlopers from case (2) that fail 
to be identified using photometry. 

We evaluate /(Asel — Ai n t, zselg) by tabulating num¬ 
ber counts from the COSMOS (Capak et al. 2007; Ilbert 
et al. 2009) Mock Catalog (CMC) (Jouvel et al. 2011) for 
each potential interloper. The COSMOS survey is a com¬ 
bination of various surveys which, after cutting out ar¬ 
eas masked due to bright stars, together include 538,000 
galaxies over 1.24 deg 2 . The CMC was constructed by 
converting the UV magnitudes of the galaxies to star for¬ 
mation rates to [O II] luminosities using the Kennicutt 
relations (Kennicutt 1998). The SFR-UV calibration is 
based on the Salpeter stellar initial mass function (IMF) 
(Salpeter 1955), which would be modified for more mod¬ 
ern IMFs, e.g. Chabrier (2003), Also, the error on the 
[O IlJSFR calibration is 30%, mainly due to the different 
values for different stellar types. The [O II] luminosities 
are then converted to other emission lines, including Ha, 
H/l, and [O III] , using measured line ratios, many of 
which are uncertain and out-of-date. Thus we do not ex¬ 
pect the CMC to perfectly reflect reality, but it serves as 
a useful first look at expected interloper rates for future 
surveys. 

We also recalibrate the CMC to account for updated 


luminosity functions (LFs). Note that many of the emis¬ 
sion lines were recently updated (Zoubian et al., in prep.). 
The Ha lines in the CMC were calibrated to the LF in 
Geach et al. (2010). The other line luminosities were 
scaled from Ha using measured line ratios (Ilbert et al. 
2009). Thus [O III] luminosities are thus also calibrated 
to the Ha LF, but this is particularly dangerous given 
the large observed variation in [O III] /Ha ratios. We re¬ 
calibrate the Ha and [O III] lines to be consistent with 
the recent LFs from Colbert et al. (2013). Specifically, 
we use the Colbert LF for [O III] with a fixed. Also note 
that the [O III] LF in Colbert et al. (2013) applies to the 
[O III] 5007 flux only, not the total doublet flux. The re¬ 
calibration is performed for each line by first comparing 
the number densities n(> L ) based on the old and new LF 
then transforming each emission line galaxy’s luminosity 
such that n(> L) matches the new LF. Specifically, we 
compare n(> Ln a ) from Geach et al. (2010) to n(> Lhq) 
and n(> Loin) from Colbert et al. (2013) to recalibrate 
Ha and [O III] 5007. We performed these recalibrations 
in redshift bins of Az = 0.1. We also recalibrate [O III] 
4959 by setting the intrinsic line ratio L 5007 /L 4959 = 3. 

I 11 order to show that our re-calibrated emission lines 
indeed have emission properties consistent with Colbert 
et al. (2013), we plot equivalent width (EW) distribu¬ 
tions for Ha and [O III] emitters within a mock sam¬ 
ple produced using the CMC corresponding to the HST 
WFC3 Infrared Spectroscopic Parallels (WISP) survey 
(Atek et al. 2010), which was used to determine the lu¬ 
minosity functions in Colbert et al. (2013). Note that, 
similar to Colbert et al. (2013), we require the Ha and 
[O III] lines to have a SNR > 5, where the spectral noise 
is given in Fig. 5 of Atek et al. (2010) according to the 
WISP survey. We plot the distributions both before and 
after re-calibration. In Fig. 5, we see better consistency 
after re-calibration than before with the EW distribu¬ 
tions in Figs. 5 and 6 of Colbert et al. (2013). However, 
we do see significant differences for the [O III] sample at 
high redshifts. Assuming our calibrations were performed 
correctly, this would imply that the continuum measure¬ 
ments in the mock catalog are not consistent with those 
from the WISP survey. These results suggest that ap¬ 
proximately 40% of galaxies may be affected. 

As another check, we also compare the intrinsic 
Ha/H/3 line ratio before and after re-calibration, tak¬ 
ing into account the galactic extinction. Before re¬ 
calibration, we find the mean line ratio to be 3.70, while 
after re-calibration, the ratio decreases to 2.92, which is 
much closer to the expected value of 2.86 based on atomic 
physics predictions (Dopita & Sutherland 2003). 

We also plot in Fig. 6 the average radius binned over 
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Fig. 5. The equivalent width (EW) distributions of Ho: (top) and [O III] 
(bottom) emitters in a mock WISP sample constructed from the CMC. 
The dashed (solid) histograms are the distributions before (after) 
re-calibration to LFs in Colbert et al. (2013). The dash-dotted histograms 
are from Colbert et al. (2013). The EW distributions from the re-calibrated 
sample show better agreement with Colbert et al. (2013) than the 
un-calibrated sample, yet there are differences, particularly in the [O III] 
sample at high EW. 

Note that plots in Colbert et al. (2013) and our plot 
include Fpjii] = 0.4Thc« in the Ha flux, while [O III] in 
Colbert et al. (2013) and our plot refers to the [O III] 
5007 line only. 

Ha and [O III] flux for all the objects in this mock WISP 
sample, both before and after re-calibration, in an at¬ 
tempt to reproduce Fig. 11 in Colbert et al. (2013). Our 
error bars are much smaller than those in their Fig. 11 
because we include objects over the whole COSMOS field, 
which is ~ 37x larger than the WISP field. Our results 
are mostly consistent with their estimates of the radius- 
flux relations for the two emitters except for the 4th flux 
bin for the [O III] emitters, in which our average radius 
is higher than their estimate by about 3cr. However, our 
[O III] radius-flux relation after calibration is closer than 




Fig. 6. The radius-flux relation for Ha (top) and 0111 (bottom) emitters 
in a mock WISP sample constructed from the CMC. The dashed (solid) 
points are the distributions before (after) re-calibration to LFs in Colbert 
et al. (2013). The black triangles are the points from Colbert et al. (2013). 
The radius-flux relation derived from the re-calibrated sample mostly show 
agreement with Colbert et al. (2013). Note that plots in Colbert et al. 
(2013) and our plot include Tnii — 0-4-Fho: * n the Ho: flux, while [O III] in 
Colbert et al. (2013) and our plot refers to the [O III] 5007 line only. 

the pre-calibration result to Colbert et al. (2013), and 
this discrepancy should not affect our results since [O III] 
emitters at such high fluxes should be easily observed by 
PFS and WFIRST, regardless of the effective radius. 

Recently, the Cosmic Assembly Near-Infrared, Deep 
Extragalactic Legacy Survey (CANDELS) (Galametz 
et al. 2013; Guo et al. 2013) released 0.1 deg 2 of multi¬ 
wavelength data. This data revealed that the CMC has 
an excess of galaxies with photometric measurements 
i — H > 1 not present in the CANDELS data. This 
discrepancy is mostly due to extrapolations of photom¬ 
etry for very faint objects, as well as possibly incor¬ 
rect spectral and photometric templates. We find that 
many galaxies in the CMC that comprise the PFS and 
WFIRST surveys are given i and H band measurements 
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with i — H > 1. Specifically, 29% of PFS galaxies, 92% 
of WFIRST Ha galaxies, and 89% of [O III] galaxies in 
the CMC are in this region in color space, implying that 
our photometric tests and simulations for these surveys 
may be too optimistic (i.e. based on simulated galaxies 
that have stronger Balmer breaks than the real galax¬ 
ies). Fixing this discrepancy between the mock catalog 
and the observed population of galaxies will be the sub¬ 
ject of ongoing work, as the mock catalogs used in LSS 
survey forecasting have undergone continuous improve¬ 
ment over the years. For the purposes of this paper, we 
merely note the existence of this issue and comment later 
on its possible implications. 

4.1 PFS mock survey 

The PFS survey, which has a spectral range of 0.67/mr < 
A < 1.26/rm, will determine the spectroscopic redshifts of 
[O II] galaxies in the redshift range 0.8 < z < 2.38. The 
[O II] doublet consists of two emission lines with wave¬ 
lengths 3726A and 3729A. The PFS spectral resolution is 
high enough to resolve both lines if they are visible, but 
sky lines and the variable line ratio of the [O II] doublet 
could cause one of the peaks to not be visible. Therefore, 
we must be able to tell if a single line could possibly be 
[O II] emission rather than other emission lines, or inter¬ 
lopers. 

In this analysis, we determine the rate at which inter¬ 
lopers will appear as [O II] emitters in PFS, as well as the 
performance of other strategies to minimize these inter¬ 
loper rates. The interlopers we consider in this study are 
Lya 1216, H/3 4861, [O III] 4959/5007, Ha 6563, [N II] 
6584, and [S II] 6727/6731. Note that PFS will have a 
spectral resolution high enough to resolve between Ha 
and [N II], as well as the [O III] doublet. The PFS sur¬ 
vey will use HSC for target selection of [O II] candidates. 
PFS will only target emitters that pass the following pho¬ 
tometric cuts, 

22.8 < g < 24.2 AND - 0.1 < g - r < 0.3 
ANDNOT (g > 23.6 ANDr — i > 0.3). (25) 

We simulate these cuts for the [O II] emitters and the 
interlopers in the CMC. Fluctuations in the magnitudes 
due to instrumental noise could affect which galaxies pass 
the cuts. We account for this by converting the CMC 
magnitudes to continuum fluxes f u for all the galaxies, 
upon which we simulate a continuum flux error <t(/„) de¬ 
termined from the magnitude depth for each photometric 
band. We use the continuum fluxes to simulate fluctua¬ 
tions in the HSC photometry due to instrumental noise, 
in which we convert the magnitude depth for each HSC 


photometric band into a continuum flux error <7 (/j,). 

We show the doublet and singlet noise curves for PFS 
in Fig. 7. We treat [O II] as a doublet line emission, with 
the signal consisting of the sum of both lines in the dou¬ 
blet. The sky lines in the noise curve for the doublet are 
lower than those for the singlet noise curve because an 
individual sky line can only disrupt one member of the 
doublet, making it likely that the other member could 
still be detectable. 5 A singlet, however, can be disrupted 
by a single sky line. We assume each galaxy has an ex¬ 
ponential profile with a half-light radius r e s — 0.3 arcsec 
for our noise curve, and we neglect any variation of the 
source r e g in the PFS analysis. Note that we treat each 
of the interlopers that are doublets as separate single 
emission lines since either line could interlope [O II] . We 
also require in our analysis that any primary line in a 
secondary line identification, whether an [O II] doublet 
candidate or an interloper, have a SNR > 8.5, while the 
secondary line must have SNR > 4. We also simulate 
flux errors due to the PFS detector noise to account for 
lines with fluxes close to the cutoff that shift above or 
below it. 

The HSC photometry used for target selection can also 
identify catastrophic redshift errors from interlopers. We 
use the CMC to construct the colors g — r, r — i, and i— z 
for the objects that pass the PFS target selection and 
use them to find photometric cuts for redshifts with high 
interloper fractions. Note that these photometric cuts 
are affected by the fluctuations in the HSC photometry 
introduced earlier. 

4.2 WFIRST mock survey 

WFIRST will have a spectral range of 1.35/rm < A < 
1.95/rm for its cosmology survey, allowing it to map Ha 
ELGs over the redshift range 1.05 < z < 2 and [O III] 
(5007A) ELGs over the redshift range 1.7 < z < 2.9. 
Unlike PFS, this mission is space-based, so sky lines will 
not contaminate the signal. While its low noise curve 
will allow the identification of millions of galaxies at high 
redshifts, interlopers will also be more readily detected. 
Also, the WFIRST spectrograph, or grism, will be slit¬ 
less, removing any requirements for target photometry. 

The interlopers we consider for WFIRST are the same 
as for PFS with [O II] being an additional interloper. 
Note that the WFIRST grism’s spectral resolution will 
not be high enough to resolve Ha and [N II] . Thus, for 
both SELGs and interlopers in the WFIRST mock sur¬ 
vey, we will combine the Ha flux with the [N II] flux. 

5 Note that we neglect increased [O ll] redshift errors caused by disrupted 
sky lines and blending of the doublet. 
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Fig. 7. The PFS flux noise curve for emission lines. The upper plot is for 
the [O ll] doublet and the lower plot is for a single interloper line. 


Since the WFIRST forecasts in Spergel et al. (2013) 
assumed a constant ratio Ftui/Fn a = 0.4, we will at¬ 
tempt to match their forecasts by not using the Nil 
fluxes in the CMC but instead setting Fnii = 0.4FH a • We 
also consider the Paschen lines Paa (1.88 pm) and Pa/3 
(1.28 pm), which appear in the infrared. The Paschen 
lines are not simulated in the CMC, so we determine 
their (unextincted) fluxes by scaling them with the Ha 
line fluxes, using the atomic line ratios Paa/Ha=0.119 
and Pa/3/Ha=0.0570 from Appendix B of Dopita & 
Sutherland (2003). 6 We show noise curves for WFIRST 
in Fig. 8. In the WFIRST analysis we take into account 
variations in the noise curve due to r e ff. For the WFIRST 
mock survey, we require a SNR > 7 to detect a primary 
line and SNR > 4 to detect a secondary line. As in the 
PFS mock survey, we also simulate flux errors due to the 
instrument. 

6 We use the Pa-Ha ratios from the low-density limit for T e = 10 4 K, a 
typical value, but the temperature dependence is very shallow. 


z Ha 



A [pm] 


Fig. 8. The WFIRST flux noise curves for emission lines. We plot the 
cases for a point source (solid) as well as for an effective radius of 
r eff = 0-4 arcsec (dashed). 


We also test the use of photometry to remove inter¬ 
lopers from the WFIRST survey. WFIRST has four pho¬ 
tometric bands: F106 (1.06 pm), F129 (1.29 pm), F158 
(1.58 pm), and F184 (1.84 pm). While these bands are 
not used for determining targets for spectroscopy, they 
can still be used to determine locations in color space 
which can correlate with redshift. The first three bands 
correspond to the three near-infrared bands Y, J, and H. 
We also consider using photometry from LSST, which has 
the optical bands g, r, i, z, and y, where the y band has a 
similar wavelength range as WFIRST’s Y band, though 
the Y band has a better response on the red end. We use 
the CMC to construct all the various color combinations 
and continuum fluxes /„ for all the galaxies. 7 We perform 
this exercise to determine photometric cuts within red- 
shift bins of Az = 0.2 that can identify interlopers within 
the Ha and [O III] surveys. As for the PFS mock survey, 
we use the continuum fluxes to simulate fluctuations in 
the WFIRST and LSST photometry due to instrumen¬ 
tal noise. We do not include photometry using the F184 
band because it had not been tabulated in the current 
version of the CMC. 


5 PFS results 

In this section we present results of our interloper con¬ 
tamination study for the PFS mock survey. We identify 
Ha as a secondary emission line for identifying Oil, and 
we determine effective secondary emission lines for po- 

' LSST, Euclid, and WFIRST have slightly different wavelength ranges for 
each of its similar photometric bands. We did not color-correct for these 
differences, but we expect the implications of this approximation to be 
minor. 
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Fig. 9. A histogram of [O ll] emitters detectable by PFS in redshift bins 
Az — 0.01. The dashed curve is the histogram for all [O ll] emitters with 
SNR > 8.5, and the solid curve is the histogram for those with a 
detectable Ha line. The error bars denote variations in the distribution of 
detected [O ll] counts, are determined using simulated flux noise, and are 
slightly shifted for clarity. The decrement in the fraction of [O ll] emitters 
with Ho: lines at 2:011 — 0.825 is due to Ha encountering a sky line at 
Ana — 1.2/xm. 

tential interlopers. We predict that ELG interloper rates 
can be reduced to less than 0.2% using secondary line 
identification and photometry. 

5.1 Secondary line identification for [0 ll] 

We tested several emission lines to determine any that 
could serve as a secondary emission line for [O ll], and 
we found that the best choice by far was the Ha line, 
mainly because it is a much stronger emission line than 
any other lines within PFS’s wavelength range. Since 
the PFS wavelength window ends at 1.26/rm, Ha-[0 ll] 
line identification can only be used for zon < 0.92. Over 
the range 0.8 < zon < 0.92, we find that 94% of [O ll] 
emitters in our PFS mock have a detectable Ha line. 
As a function of redshift, we see in Fig. 9 that in most 
redshift bins in this redshift range, most [O ll] lines in 
the PFS mock are identified using the Ha line. There is 
a large decrement in the fraction of [O ll] emitters with 
Ha lines at zon = 0.825, due to Ha encountering a sky 
line at AHa — 1.2/zm, which causes the percentage of [O ll] 
emitters paired with Ha to drop to 60%. 

5.2 Secondary line identification for interlopers 

We repeat the exercise from the previous section for po¬ 
tential interlopers of the [O ll] doublet to see if they could 
be identified. For Lya, the PFS photometric cuts alone 
eliminate them from the sample. For the other interlop¬ 


ers, we determine other lines that could help rule them 
out as [O ll] candidates. We list the interlopers with 
their secondary lines in Table 2. In Fig. 10, we plot 
/(Oil — Int, zoii) for all the interlopers. Some of the 
lines, like [O III], can have increased interloper rates be¬ 
cause they tend to have fluxes just below the detection 
threshold, which then shift due to flux noise. The flux 
noise tends to cause more of them to shift above the 
threshold than to shift below the threshold. We also see 
that secondary line identification can eliminate Ha, [S ll], 
and [N ll] entirely. This method also decreases rates for 
[O III] 4959/5007 and H/3 to less than 1% contamination 
over each small redshift bin. Of course, the CMC is not 
perfect, and the luminosity functions on which this cat¬ 
alog is based, including for Ha and [O ll], are uncertain. 
Overall, it seems that the remaining interlopers that may 
contaminate the PFS results are [O III] and H/3 at [O ll] 
redshifts zon > 2. 

We reduce the [O III] emission lines further by using 
secondary line identification for the doublet with a SNR 
cutoff of Q c = 1 (see Sec. 3.1). We also include the effect 
from [O ll] lines being identified as [O III] interlopers due 
to a statistical fluctuation appearing as an [O III] doublet 
partner. We model the contribution of fluctuations from 
the continuum by estimating according to our formalism 
that up to P 2 = 16% of all [O ll] lines will appear to be 
[O III] lines and be cut due to our low cutoff for the [O III] 
4959 line. We still find, however, that this secondary line 
identification scheme allows us to identify [O III] 5007 
interlopers in our mock PFS survey. Specifically, the re¬ 
sulting [O III] 5007 interloper fraction is less than 0.2%, 
except in the highest redshift bin (2011 = 2.35) where the 
fraction is slightly above 0.2%. Thus, we conclude that 
[O III] interlopers should be removable for PFS using sec¬ 
ondary line identification. 

We also consider how our PFS results are affected if 
any PFS target selection cuts (see Eq. 25 were removed. 
We find that if only the 22.8 < g < 24.2 cut is kept, the 
interloper fractions increase well past our 0.2% target. 
This is the case particularly for [O III] interlopers, which 
exhibit a contamination level of / = 10% in this scenario. 

5.3 Photometry 

H/3 is the remaining interloper in the PFS survey with an 
interloper fraction /(zon ~ 2) ~ 0.6%, and we attempt to 
use HSC photometry to remove these interlopers. For H/3 
interlopers in the range zon = 1.9-2.1 (zup = 1.22-1.38), 
we find these interlopers fail the photometric cut i — z < 
0.67 (r — i) +0.167. Based on the CMC, this cut removes 
only ~ 2% of [O ll] emitters, while not identifying only 
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Table 2. Potential interlopers with the secondary emission lines used to identify them. 


Line 

Secondary (PFS [O Ii] ) 

Secondary (WFIRST Ha) 

Secondary (WFIRST [O III] ) 

Ly a 

N/A 

N/A 

N/A 

[O II] 

N/A 

[O ill],H/3 

[O hi], H/3 

H/3 

Ha,[0 Ii] 

[O m] 

[0 iii],[0 ii] 

[O m] 

[O ii] 

[O II], Ha,H/3 

N/A 

Ha 

[O ii],[O ill] 

N/A 

[0 m],[S ii] 

[N II] 

[O ii],[O ill] 

N/A 

[0 m],[S ii] 

[S II] 

Ha 

Ha 

Ha,[0 III] 



















































16 


Publications of the Astronomical Society of Japan, (2014), Vol. 00, No. 0 



Fig. 11. A histogram of /(Oil — Hfi, 2 : 011 ) in redshift bins Az = 0.1 for 
the PFS survey. The dashed, blue curve includes all secondary line 
identification and photometric cuts (represented by the dash-dotted line in 
Fig. 10) while the solid, green line is the same with double the photometric 
errors. The solid, purple line denotes the target interloper fraction 
/ = 0.2%. We see that the increase in interlopers is not enough to cause 
further significant contamination. 

~ 0.1% of H/3 interlopers. Using this cut eliminates the 
remaining H/3 interlopers in this redshift range. We also 
see an interloper rate of 0.3% in the range 2011 = 2.2— 
2.3 (zh /3 = 1.45-1.53); however, we were unable to find a 
satisfactory photometric cut in this redshift range using 
HSC bands. 

It is also likely that the Balmer + 4000 A break feature 
is weaker in the real galaxies than in the CMC (see §4). 
To test our sensitivity to this effect, we repeat our analy¬ 
sis of the H/3 photometric cut while doubling the errors in 
all the photometric bands (this is a crude proxy for halv¬ 
ing the break stength). The result is shown in Fig. 11, 
where we see that the number of interlopers does increase, 
but the contamination level is still less than 0.2% at most 
redshifts. 

6 WFIRST results 

In this section we determine the interloper rates for 
WFIRST’s Ha and [O III] (5007A) surveys. It does ap¬ 
pear that WFIRST could potentially exhibit greater in¬ 
terloper contamination than PFS. After implementing 
secondary line identification, the main interlopers for the 
surveys are [O II], [O III] (5007A), Ha, and the Paschen 
lines. We do find that the photometric bands from LSST 
and WFIRST can reduce the remaining interlopers in the 
Ha survey to less than 0.2% at all redshifts. However, 
the [O III] survey still exhibits interloper rates of up to 
a few percent even after photometric cuts. The funda¬ 



Fig. 12. A histogram of Ha emitters detectable by WFIRST in redshift 
bins A 2 = 0.025. The dashed curve is the histogram for all Ha emitters 
with SNR > 7, and the solid curve is the histogram for those with a 

visible [O III] (5007A) I ine. The error bars denote variations in the 
distribution of detected Ha counts determined using simulated flux noise. 

mental reason for this appears to be that at WFIRST 
flux levels, there are many more Ha emitters than [O III] 
emitters, hence the contamination of the [O III] sample 
by Ha can be significant even if secondary line identi¬ 
fication and photometric cuts exclude most of the Ha 
contaminants. 

6.1 Ha survey 

Our attempt to find a candidate for the secondary 
line identification of Ha, whose redshift range for the 
WFIRST survey is 1 < z h q < 2, was unsuccessful. [O III] 
(5007A) is the best candidate in that it appears with 
Ha over the longest spectral range, specifically over 
zho >1.7, and [O III] is one of the brightest emission 
lines. However, only 43% of Ha lines with zua > 1.7 
detected by WFIRST will have a corresponding [O III] 
line with SNR > 4 within the WFIRST spectral range. 
We plot in Fig. 12 a comparison of number counts for 
detected Ha lines with and without secondary line iden¬ 
tification. Thus, we will not attempt to use secondary 
line identification to confirm Ha lines, but instead as¬ 
sume all single lines are Ha unless we can prove they are 
interlopers. 

We also find emission lines that serve as efficient sec¬ 
ondary lines for potential interlopers, which are listed in 
Table 2. As with the PFS survey, Lya is below the flux 
cut for WFIRST. The interloper rates for the remain¬ 
ing lines are shown in Fig. 13. We see that although 
secondary line identification eliminates most of the inter¬ 
loper fractions down to < 1%, [O III] and [O II] appear 
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Fig. 13. A histogram of /(Ha — X,zna) in redshift bins A z = 0.1 for the WFIRST Ho: survey. The dashed, blue curve is the histogram for all interlopers 
with SNR > 7, and the solid, green curve is the histogram for those that cannot be ruled out as an Ho: candidate using secondary line identification. The 
dotted, cyan line in the [0 III] 5007 plot is the interloper rate after implementing secondary line identification using the [O III] 4959 line. The dash-dotted, 
red line includes photometric cuts described in Sec. 6.3. The error bars denote variations in the distribution of interloper fractions determined using 
simulated flux noise. The solid, purple line denotes the target interloper fraction / = 0.2%. Note that the interlopers that are members of doublets are 

labeled a and b which denote the shorter and longer wavelength line, respectively. 


to remain with high interloper rates of around 30%, al¬ 
though we are able to reduce [O III] down to 10% by 
identifying [O III] 5007 using the [O III] 4959 line (see 
Sec. 5.2. The limitation in eliminating these interlopers 
is partially due to the limited spectral range of WFIRST, 
which limits the use of Ha to identify [O III] and [O III] 
to identify [O II]. The Paschen lines Paa and Pa/3 have 
no viable candidates for secondary line identification be¬ 
cause their wavelengths are too far apart from other lines 
and each other. 


6.2 0111 survey 

The WFIRST [O III] (5007A) survey, which has a redshift 
range of 1.7 < zoiii < 2.9, can use the [O III] 4959 line 
(SNR > 1) for secondary line identification. As seen in 


Fig. 14, using the [O III] 4959 line keeps almost all (96%) 
of the [O III] 5007 sources. 

Using the [O III] 4959 line to identify [O III] 5007 will 
not eliminate all interlopers perfectly; random fluctua¬ 
tions from the continuum at the right wavelength can 
cause an interloper to appear to have a corresponding 
[O III] 4959 line. As before (see Sec. 3.1), the CMC does 
not simulate fluctuations from the continuum, so we have 
to estimate how often a false [O III] 4959 line will appear. 
Based on our formalism, we expect that only P 2 = 16% 
of interlopers will have a corresponding [O III] 4959 line 
for Q c = 1 due to statistical fluctuations, so we multi¬ 
ply the interloper number counts in our analysis by P 2 - 
After testing potential interlopers for secondary line can¬ 
didates, we find that [O II], Ha, and the Paschen lines are 
the major interlopers left after secondary line identifica- 
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Fig. 14. A histogram of [O III] (5007A) emitters detectable by WFIRST in 
redshift bins Az = 0.025. The dashed curve is the histogram for all [O III] 
emitters with SNR > 7, and the solid curve is the histogram for those 
with an [O III] 4959 line with SNR > 1. The error bars denote variations 
in the distribution of detected [O III] counts determined using simulated 
flux noise. 

tion. The interlopers’ secondary lines are listed in Table 
2, and the interloper rates are shown in Fig. 15. Paa does 
not become a problem until zoiii > 2.7, so we may be able 
to eliminate this interloper by using secondary line iden¬ 
tification for [O III] 5007 using [O III] 4959. The other 
three interlopers appear to have high interloper rates, 
particularly at the lower redshifts for Ha and Pa/3 and 
higher redshifts for [O II]. 

For WFIRST, we see that Fla and [O III] could heav¬ 
ily contaminate each others’ surveys even after secondary 
line identification, and [O II] Paa, and Pa/3 could con¬ 
taminate both surveys. It appears that better methods 
will be required to reduce these interlopers. 

6.3 Photometry 

We attempt to use photometry to eliminate more inter¬ 
lopers from the WFIRST surveys. Since LSST should be 
online long before WFIRST, we explore the use of both 
WFIRST and LSST photometric bands to remove the 
interlopers remaining after secondary line identification. 
In Tables 3 and 4, we list the best photometric cuts to 
identify [O III], [O II], [S II], Paa, and Pa/3 interlopers 
in the Ha survey and Ha, [O II], [S II], Paa, and Pa/3 
interlopers in the [O III] survey. We also consider, based 
on the CMC, how often true SELGs fail the cuts and in¬ 
terlopers pass the cuts. The error rates are all less than 
5%, with most less than 1%. 

When these photometric cuts are implemented for in¬ 
terlopers in the Ha survey, all interloper rates decrease 
to less than 0.2%, satisfying our requirement, except for 


Paa, which still has a 1% interloper fraction in the high¬ 
est redshift bin. For the [O III] survey, we find that Ha 
and Pa/3 are still significant interlopers, with interloper 
rates due to Ha greater than 1% at some redshifts. In the 
same manner as for PFS, we consider the effect of dou¬ 
bling the photometric errors on these cuts, in order to 
consider the effects of breaks that are weaker than those 
in the CMC. The interloper rates for [O II] and [O III] in 
the Ha survey and for [O II] and Ha in the [O III] sur¬ 
vey increase significantly, as we see in Fig. 16, with [O II] 
in the Ha survey passing the 0.2% level and Ha in the 
[O III] survey even reaching 10% contamination at some 
redshifts. 

Part of the reason Ha is such a large contaminant 
is that Ha-emitters are the dominant population in the 
WFIRST survey, as by design, so a very large number of 
Ha interlopers need to be eliminated to reduce the con¬ 
tamination level to less than 0.2%. Also, it can be shown 
that for 2.3 < zom < 2.7, the [O III] color locus for the 
relevant photometric cut is close enough to the Ha in¬ 
terloper color locus that random fluctuations cause them 
to overlap, greatly reducing the power of the cut. On 
the extreme ends of the [O III] survey redshift range, the 
color loci are farther apart, which reduces the popula¬ 
tion of Ha interlopers considerably. Ha interlopers could 
be mitigated somewhat by identifying them in the Ha 
survey, but this would inevitably cause some [O III] lines 
misidentified as Ha lines to be removed. For these cases, 
a deep spectroscopic training samples may be required to 
accurately measure the interloper fraction. 

6.4 Y-band Photometry 

Most of the photometric cuts needed to identify interlop¬ 
ers require the Y band, a fairly new photometric band for 
LSS surveys. To motivate the required Y-band depth for 
these surveys, we determine the distribution of Y-band 
magnitudes expected in the WFIRST survey, both for the 
SELGs and the major contaminants that are not removed 
through secondary line identification. In our analysis we 
simulate fluctuations in the WFIRST Y-band magnitude 
by converting the magnitude depth of the band (Y = 26.7 
for point sources, 5<r) into a continuum flux error. The 
fluctuations are then added to the continuum Y-band 
fluxes of the emitters and converted back to apparent 
magnitudes. 

We plot the Y-band magnitude distributions in 
Fig. 17. It can be seen that all of the galaxies and in¬ 
terlopers will have a Y-band magnitude Y < 26.7. While 
a deeper Y survey would be preferable, WFIRST in its 
current configuration should be able to determine the Y 
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Fig. 15. A histogram of/(OIII — X, 2 : 0111 ) in redshift bins Az = 0.1 for the WFIRST ONI survey. The format of the figure is similar to Fig. 13. 


magnitude for most of the galaxies (SELGs and interlop¬ 
ers) in the survey. Note that the LSST (y < 25) or Euclid 
(Y < 24) photometry in this band would miss a signifi¬ 
cant number of the source galaxies, and so at present it 
appears there is a need for the WFIRST Y -band photom¬ 
etry in order to carry out the galaxy redshift survey pro¬ 
gram. However, the WFIRST survey will require LSST 
optical bands to implement many of the other photomet¬ 
ric cuts needed to remove interlopers. 

7 Conclusions 

We predict the effects of interlopers on cosmological pa¬ 
rameter estimates from upcoming and future LSS sur¬ 
veys. We construct a formalism describing our interloper 
contamination biases estimates of the power spectrum 
as well as cosmological parameter estimates. Interloper 
fractions / > 0.5% can bias the measured power spectrum 
from LSS surveys, particularly on smaller scales, while 
/ > 0.2% will significantly increase error estimates. Also, 


contamination levels greater than 0.15-0.3% can signifi¬ 
cantly bias the growth rate measurements from upcoming 
surveys. 

Using the CMC, we estimate the amount of interloper 
contamination that can occur in upcoming surveys. Note 
again that our analysis with the CMC is idealized in that 
it does not treat blended objects and is limited by the 
bank of SED templates in the catalog, but it represents 
a good “first look” at how serious the problem might be, 
and what mitigation strategies might be employed. In 
our estimates we simulate the interloper cleaning meth¬ 
ods of secondary line identification and photometric iden¬ 
tification. While we expected secondary line identifica¬ 
tion to be effective, we find specifically that this method 
can potentially eliminate most interlopers from the PFS 
survey, but it will leave large interloper fractions in the 
future WFIRST survey. We find photometric cuts are 
well able to identify interlopers in WFIRST. We pre¬ 
dict that WFIRST can exhibit contamination levels up to 
30% using only secondary line identification, which would 
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Table 3. Photometric cuts for WFIRST Ha survey as a function of redshift. 


ZHa 

OIII 5007 photometric cut 

Oil photometric cut 

0.9-1.1 

Y — J < 1.02(i — z) + 0.11 

r — i> 0.52 (g — r) + 0.052 

1.1-1.3 

Y-J <l.l(i-z) + 0.15 

Y — J < 1.4(i — z) — 0.2 

1.3-1.5 

Y- J < 1.935(z-Y) +0.161 

i — z> 0.36(p — r) — 0.02 
&r — i > 0.56(g — r) — 0.12 

1.5-1.7 

T TT f 4.2(z-Y)-0.1 if z — Y < 0.2; 

\ 0.6(z — Y) + 0.7 if z — Y > 0.2 

r — i> 0.68 (g — r) — 0.364 

1.7-1.9 

J H < 0.714(Y - J) + 0.071 

r — i> 0.62 (g — r) — 0.438 

1.9-2.0 

J-H <Y -J + 0.1 

r-i > 0.75(3-r)-0.75 


SII 6731 photometric cut (zh q = 0.9 — 1.1) 

Paa photometric cut (zjj q = 1.8 — 2.0) 


J — H > 0.91(Y — J) — 0.228 

Y — J > 2.33(z — Y) + 0.05 


Pa/3 photometric cut 

0.9-1.1 

Y — J < 0.89(* — z) 


1.1-1.3 

i — z> 0.583(3 — r) 


1.3-1.5 

i — z > 0.3 (g — r) +0.15 


1.5-1.7 

z — Y > 0.167(r — j) + 0.033 


1.7-1.9 

Y — J > 2.0(i — z) — 0.1 


1.9-2.0 

Y — J > 1.9(* — z) 



Table 4. Photometric cuts for WFIRST Olll survey as a function of redshift. 


^OIII 

Ha photometric cut 

Oil photometric cut 

1.7-1.9 

Y- J> 1.5(*-z) 

J — H < 0.94(Y — J) + 0.04 

1.9-2.1 

Y - J > 2.35(z - Y) + 0.2175 

J-H < 1.11(Y — J) — 0.1 

2.1-2.3 

Y — J > 3.0(z — Y) + 0.1 

r-i> 0.58(3-r)-0.16 

2.3-2.5 

J H > 6.06(z — Y) — 0.621 

i — z> 0.4(3 — r) — 0.2 

2.5-2.7 

J — H > 3.5(z — Y) 

*-z> 0.42(3 -r) -0.368 

2.7-2.9 

J — H > 0.947(Y - J) - 0.226 

i — z > 0.4(3 — r) — 0.4 


SII 6731 photometric cut (zoiii = 1.7— 1.9) 

Paa photometric cut (zoiii = 2.7 — 2.9) 


Y- J> 1.625(i-z) + 0.1 

J — H > 0.81(Y — J) + 0.16 


Pa/3 photometric cut 

1.7-1.9 

Y — J > 1.5(£ — z) + 0.2 


1.9-2.1 

Y — J > 2.556(z - Y) + 0.2556 


2.1-2.3 

Y — J > 3.25(z — Y) + 0.2 


2.3-2.5 

z - Y > 0.714(r - *) - 0.1572 


2.5-2.7 

z — Y > 0.25(* — z) — 0.05 


2.7-2.9 

J — H > 1.167(z — Y) +0.1167 
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Fig. 16. Histograms of interloper fractions for the WFIRST Ha and [O III] surveys. In each plot, the dashed, blue curve includes all secondary line 
identification and photometric cuts (represented by the dash-dotted lines in Figs. 13 and 15) while the solid, green line is the same with double the 
photometric errors. The solid, purple line denotes the target interloper fraction / = 0.2%. Also the left column is for the Ha survey, while the right column 
is for the [O III] survey. We see that these interloper rates increase significantly when the photometric errors increase. 



be catastrophic for dark energy measurements, but pho¬ 
tometric cuts can reduce contamination in WFIRST to 
less than 0.2% for most redshifts in the Ha survey. In 
the [O III] survey, large interloper fractions remain after 
photometric cuts. Deep spectroscopic surveys will be re¬ 
quired to measure the significant interloper fractions that 
remain in the PFS and WFIRST surveys, particularly for 
Ha in the [O III] survey if it cannot be readily identified 
from the Ha survey. We caution again that these are 
point estimates based on the current “community stan¬ 
dard” mock catalog, and will evolve as the fidelity of the 
mock catalogs and ultimately the real data from these 
instruments become available. 

These strategies are applicable to many upcoming 
emission line surveys, including Euclid ; however, the ac¬ 
tual implementation will depend heavily on spectroscopic 
and photometric sensitivities and wavelength ranges, as 
well as survey areas which will determine available ancil¬ 


lary photometric surveys. This analysis was performed 
specifically for optical/NIR surveys, but the strategies 
can also be implemented in surveys at other wavelengths, 
e.g. millimeter wavelengths, where there are multiple 
lines and where instrumental or atmospheric transmis¬ 
sion considerations limit the available spectral range. In 
addition, causes of scale-dependent clustering bias such 
as non-Gaussianity and neutrino masses will have vary¬ 
ing sensitivities to the interloper fraction. Future work 
should determine acceptable interloper fractions for other 
cosmological parameters as well as the implementation of 
these strategies for other upcoming surveys. 
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Fig. 17. A histogram of the distribution of T-band magnitudes for the 
WFIRST Ho: survey (top) and the 0111 survey (bottom) plotted in 
T-magnitude bins AT — 0.5. Each panel includes the survey emission line 
galaxies along with contaminants that cannot be identified using secondary 
line identification, requiring photometry to be removed. The error bars 
denote variations in the distribution of T-band magnitudes determined 
using simulated flux noise in the lines and the Y magnitude. We see from 
these plots that T-band photometry must be deeper than 26th magnitude 
in order to measure colors for “almost all" of the sample. 
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